Cure: Clustering on Sequential Data for Web Personalization: Tests and Experimental Results

نویسندگان

  • K. Santhisree
  • A. Damodaram
چکیده

CURE: CLUSTERING ON SEQUENTIAL DATA FOR WEB PERSONALIZATION: TESTS AND EXPERIMENTAL RESULTS K. Santhisree, and A. Damodaram Department of Computer Science, Jawaharlal Nehru Technological University, Hyderabad E-mail: [email protected], E-mail: [email protected] The world wide web is full of multi-disciplinary data for knowledge data discovery research. In this paper we present CURE (Clustering usage Representatives) algorithm to find clusters on a web usage data. We adopted data from MSNBC.COM website which is a free news data website with different categories of news and subjects. After generating the clusters by CURE algorithm, average of inter cluster and intra cluster are calculated and the results are compared with different similarity measures like Euclidean, Jaccard, projected Euclidean, cosine and fuzzy similarity. Finally behavior of clusters that made by CURE algorithm showed on a sequential data in a web usage domain with quantify our results by the way of explanations and list conclusions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using Sequential and Non-Sequential Patterns in Predictive Web Usage Mining Tasks

We describe an efficient framework for Web personalization based on sequential and non-sequential pattern discovery from usage data. Our experimental results performed on real usage data indicate that more restrictive patterns, such as contiguous sequential patterns (e.g., frequent navigational paths) are more suitable for predictive tasks, such as Web prefetching, which involve predicting whic...

متن کامل

Lessons from the Application of Domain-Independent Data Mining System for Discovering Web User Access Patterns

This paper presents the usage of a general domain-independent data mining system in discovering of the Web user access patterns. DB2 Intelligent Miner for Data was successfully used in data mining of a huge Web log which was collected during the World FIFA Cup 1998. The clustering, associations and sequential pattern mining functions were considered in the context of Web usage mining. The clust...

متن کامل

Relational fuzzy approach for mining user profiles

Capturing the characteristics and preferences of Web users into user profiles is a fundamental task to perform in order to implement forms of personalization on a Web site. In this paper, we present a relational fuzzy clustering approach to extract significant user profiles from session data derived from log files. In particular, a modified version of the CARD clustering algorithm is proposed i...

متن کامل

Web User Profiling Using Relational Fuzzy Clustering

User profiling is a fundamental task in Web personalization. In this paper, we use a relational fuzzy clustering to discover user profiles from Web log data. Precisely, a modified version of the CARD algorithm, called CARD+, is proposed to discover clusters embedded in the Web usage data and derive profiles modeling the real user preferences. Experimental results on log data extracted from log ...

متن کامل

Data Mining for Web Personalization

In this chapter we present an overview of Web personalization process viewed as an application of data mining requiring support for all the phases of a typical data mining cycle. These phases include data collection and preprocessing, pattern discovery and evaluation, and finally applying the discovered knowledge in real-time to mediate between the user and the Web. This view of the personaliza...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011